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at  one  symbol  per  1.5  sec,  .75  sec,  and  .50  sec.  During  the  first  three  tests,  the  subjects 
were  instructed  to  press  the  correct  key  and  not  verbalize  the  number.  During  the  next 
three  tests,  the  subjects  verbalized  the  number  and  did  not  press  the  key.  During  the  last 
three  tests,  the  subjects  verbalized  the  number  and  simultaneously  pressed  the  correct 
key.  At  the  end  of  each  block  of  three  tests,  the  subjects  estimated  self-performance  as 
percent  correct  and  rated  stress  on  a  scale  of  one  (no  stress)  to  seven  (high  stress).  Voice 
signals  were  initially  recorded  on  magnetic  tape,  then  processed  through  filtering  circuits 
and  displayed  on  a  strip  chart  for  subsequent  visual  analysis  and  interpretation.  A 
subjective  scoring  criterion  was  established  and  then  translated  into  electronic 
equivalents  and  automated  on  a  Varian  73  computer  for  voice  pattern  recognition  analysis. 
Significant  main  effects  for  percent-correct  responses  were  obtained  for  groups, 
presentation  rate,  and  groups  X  presentation  rate  interaction.  No  significant  differences 
were  found  in  the  correct  responses  of  the  subjects  when  the  number  was  verbalized  or 
not  verbalized.  The  Staff  group  produced  significantly  fewer  correct  responses  than  either 
the  Jet  or  Prop  groups  at  the  .75  sec  rate.  Voice  stress  analysis  showed  significant 
correlations  with  performance  scores  and  stress  ratings  of  a  selected  pool  of  subjects 
(N=12).  The  results  were  discussed  as  to  the  potential  application  of  an  objective,  reliable, 
sensitive,  and  nonob trusive  measure  of  stress  in  vocal  communication  systems  that  require 
operator  workload  assessments. 
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PREFACE 


This  technical  memorandum  documents  a  study  conducted  at  the  Naval  Air  Test 
Center  from  February  1978  to  December  1979.  The  technical  investigation  was 
conducted  as  a  continuing  effort  of  the  Aircrew  Systems  Branch  to  support 
development  of  Human  Factors  test  and  evaluation  methodology.  Specifically,  the 
project  was  initiated  because  of  a  recommendation  to  quantitatively  determine 
operator  workload  levels  in  high  stress  environments  (reference  NAVAIRTESTCEN 
RW-9R-78). 
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INTRODUCTION 


NEED 

1.  A  need  exists  for  a  general  stress  measure  which  is  nonob trusive  and  can  be 
applied  in  actual  in-flight  operational  situations  without  interfering  with  operator 
performance.  Ideally,  the  technique  should  be  objective  and  quantifiable,  simple  to 
collect  and  analyze,  reliable  across  individuals,  and  valid  for  specific  aircrew 
operational  test  situations. 

BACKGROUND 

2.  In  several  recent  surveys  of  potential  operator  workload  techniques  (Gartner  & 
Murphy,  1976;  Harrison,  1977;  Roscoe,  1978;  and  Wierwille,  Williges  &  Schiflett, 
1979),  it  was  found  that  no  single  technique  could  be  recommended  as  the 
definitive  measure  of  operator  workload  because  of  the  multidimensionality  of 
stress  inherent  in  an  operational  setting.  An  additional  conclusion  reached  by 
Schiflett  (1976)  was  that  the  majority  of  techniques  developed  were  used  in  the 
design  stage  of  aircrew  systems,  thereby  making  them  difficult  and/or  impractical 
to  implement  in  the  later  stages  of  operational  test  and  evaluation. 

3.  However,  it  is  very  apparent  from  observational  data  that  performance  often 
involves  some  form  of  verbal  communication  associated  with  completion  of  a  task. 
Speech  may  be  the  common  denominator  underlying  the  search  for  a  general  stress 
measure  which  is  nonintrusive  in  the  operational  testing  environment.  If  it  can  be 
demonstrated  that  stress  is  manifested  in  the  acoustic  speech  signal,  it  may  be 
possible  to  develop  an  automatic  measurement  technique  for  analyzing  a  speaker’s 
vocalizations  to  determine  when  and  to  what  degree  stress  is  present. 

4.  Previous  research  (Lieberman,  1961;  Lieberman  &  Michaels,  1962;  and  Bell, 
Ford  &  McQuiston,  1972)  in  the  field  of  voice  analysis  has  indicated  that  the 
psychophysiological  state-of-the-speaker  may  manifest  itself  in  the  acoustic 
domain  by  changes  in  the  frequency  modulation  (8  to  12  Hz)  of  the  fundamental 
frequency  (50  to  250  Hz)  of  the  voice.  An  electronic  instrument  called  the 
Psychological  Stress  Evaluator  (PSE)  was  used  to  detect,  analyze,  and  display  these 
frequency  changes  in  the  inaudible  microtremors  of  the  muscles  that  surround  the 
vocal  cords.  The  damping  of  these  muscular  undulations  is  reported  to  be 
associated  with  mild  tension  by  Lippold  (1971),  Heisse  (1974),  and  Inbar  (1976)  even 
though  there  is  some  question  to  the  origin  of  the  physiological  mircotremors 
(McGlone,  1975).  However,  in  a  thorough  review  of  the  research  associated  with 
speech  patterns  in  aircrew  workload,  Cannings  (1979)  recommended  that  the  use  of 
microtremor  would  be  a  useful  approach  and  warrants  further  exploratory  investi¬ 
gations. 

5.  The  majority  of  validation  studies  reviewed  by  Kubis  (1974),  Edson  (1976), 
Heisse  (1976),  and  Hirsch  &  Wiegele  (1979)  was  conducted  in  clinical  settings, 
laboratory  deception-detection  experiments,  or  actual  criminal  case  histories 
involving  lie  detection  of  known  solutions  either  by  confession  or  court  decisions. 
The  results,  with  the  exception  of  the  Kubis  report,  generally  support  the  validity 
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of  the  instrument  to  detect  stress  associated  with  deception.  However,  several 
serious  methodological  problems  are  discussed  by  Brenner  &  Branscomb  (1979) 
which  directly  affect  its  use  in  lie  detection  settings. 

6.  The  PSE  instrument  has  also  been  used  in  a  limited  quantity  by  a  wide  variety 
of  applications  other  than  lie  detection.  Several  annotated  bibliographies  are 
available  from  Dektor,  Incorporated,  that  provide  the  reader  with  a  good  review  of 
nondeception  research  studies  ranging  from  clinical  applications  to  political  speech 
analysis.  An  attempt  will  not  be  made  to  review  all  of  these  nondeception  studies 
but  only  those  that  use  a  research  paradigm  or  actual  field  study  to  induce  task- 
related  stress  in  individuals. 

7.  A  series  of  preliminary  studies  was  conducted  by  Planar  Corporation  (1972)  to 
test  the  feasibility  of  using  the  PSE  to  measure  stress  attributable  to  workload 
imposed  by  task  demands  and/or  perceived  risk.  Even  though  the  sample  sizes  were 
small  and  the  task  situations  were  not  under  experimental  control,  the  findings  did 
suggest  that  the  PSE  could  be  used  to  quantify  vocal  stress  in  aircraft  accident 
investigations,  air  traffic  controller  messages,  and  astronaut  communications 
during  space  missions.  An  in-depth,  follow-up  study  by  Older  &  Jenney  (1975) 
analyzed  voice  communications  of  Skylab  astronauts  engaged  in  operational  tasks 
of  varying  degrees  of  difficulty,  e.g.,  conduct  of  solar  observations  and  study  of 
earth  resources.  Although  some  statistically  significant  relationships  were  found, 
the  voice  analysis  technique  was  not  judged  to  be  sufficiently  predictive  to  warrant 
its  use  in  assessing  the  degree  of  psychological  stress  of  crew  members  in  future 
space  missions. 

8.  Favorable  validation  evidence  from  a  battery  of  psychophysiological  measures 
has  been  reported  by  Borgen  &  Goodman  (1976)  of  the  Parke-Davis  laboratories. 
The  results  yielded  systematic  changes  in  PSE  scores  to  the  stroop  color/word 
conflict  task.  The  PSE  scores  also  correlated  well  with  changes  in  heart  rate,  blood 
pressure,  skin  potential,  and  forearm  blood  flow. 

9.  Brenner,  Branscomb  &  Schwartz  (1979)  tested  the  PSE  on  a  mental  arithmetic 
task  which  had  previously  produced  graded  changes  in  heart  rate,  pupil  dilation,  and 
skin  resistance  (Kahnenan,  Tursky,  Shapiro,  &  Crider,  1969).  The  subjects 
performed  a  series  of  mental  arithmetic  problems  which  varied  in  difficulty  but 
which  had  a  fixed  execution  time.  The  PSE  scores  increased  directly  with  task 
difficulty  and  paralleled  error  data,  self-report  data,  and  previous  psychophysio¬ 
logical  evidence.  Although  shortcomings  of  the  PSE  scores  were  noted,  especially 
subjectivity  in  scoring  and  response-word  artifacts,  the  overall  results  suggest  the 
basis  for  a  practical  measure  of  stress. 

10.  Encouraged  by  these  results,  Branscomb  (1979)  and  Brenner,  Branscomb,  & 
Wright  (1979)  developed  a  computer-based  measure  of  vocal  stress  by  automating 
the  PSE  scoring  procedures  and  correlating  the  results  with  pupil  diameter  changes 
while  performing  a  mental  arithmetic  task.  A  multiple  regression  coefficient  of 
r  =  .39  was  obtained  by  combining  all  nine  vocal  measures  to  predict  task  workload. 
Coefficients  for  individual  subjects  were  higher,  ranging  in  prediction  from  11%  to 
45%  of  the  available  variance.  The  researchers  concluded  that,  given  the 
preliminary  nature  of  the  present  vocal  measure  algorithms,  the  findings  are 
clearly  encouraging  and  warrant  further  investigation. 
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PURPOSE 

11.  The  paper  documents  a  study  presented  by  Schiflett  (1979)  that  attempted  to 
determine  if  the  PSE  could  be  used  to  detect  the  amount  of  situational  stress  in  the 
voice  while  subjects  performed  a  four-choice  information  processing  task  at 
different  presentation  rates.  The  purpose  was  to  evaluate  the  relationship  between 
performance  scores,  self-rated  stress,  and  measured  vocal  stress  due  to  increases 
in  information  presentation  rates. 
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METHOD 

SUBJECTS 

12.  The  subject  pool  consisted  of  31  male  Naval  officers  and  11  male  civilians,  all 
college  graduates  between  the  ages  of  24  and  39*  At  the  time  of  testing,  the 
participants  were  all  enrolled  in  a  graduate  degree  program  sponsored  by  the  Naval 
Air  Test  Center,  Patuxent  River,  Maryland. 

DESIGN 

13.  The  effects  of  presentation  rate  on  performance  and  stress  were  investigated 
using  a  3  X  3  X  2  Analysis  of  Variance  (ANOVA)  factoral  design  (Kirk,  1968)  and 
nonparametric  statistics  (Siegel,  1956).  The  military  officers  were  divided  into  two 
groups  based  on  the  type  of  flight  squadron  experience  (i.e.,  attack/fighter  or 
multi-engine/helicopter).  The  third  group  was  composed  of  military  officers  in 
staff  (nonflight)  billets  and  civilian  personnel.  The  groups  were  designated  as 
group  I  -  Jet,  group  II  -  Prop,  and  group  HE  -  Staff.  Subjects  were  randomly 
assigned,  within  each  group,  to  the  order  of  tests. 

14.  A  subgroup  of  12  subjects  was  selected  for  vocal  stress  analysis  out  of  the 
larger  (N=42)  pool  of  subjects.  The  criterion  for  selection  was  the  rank  order  of 
percent-correct  scores,  i.e.,  four  highest,  four  lowest,  and  four  middle-ranked 
scores  averaged  across  all  presentation  rates.  These  subjects  were  selected 
irrespective  of  group  identity. 

TASK/EQUIPMENT 

15.  The  psychomotor  tests  were  displayed  to  the  subject  and  automatically  scored 
on  a  Response  Analysis  Tester  (RATER).  The  RATER  presented  a  four-choice 
discrimination  task  in  which  the  subject  was  required  to  match  a  response  key  to 
each  of  four  stimuli  (numbers  -  one,  two,  three,  and  four)  appearing  in  a  display 
window. 

16.  The  sequence  of  stimuli  was  randomly  presented  in  an  automatic-paced  mode 
for  nine  1-min  tests.  Each  subject  received  three  different  presentation  rates 
under  three  response  conditions.  The  stimuli  presentation  rates  were  one  symbol 
per  1.5  sec,  .75  sec,  and  .50  sec  for  1  min  each.  During  Condition  1  (key  presses 
only),  the  subjects  were  instructed  to  press  the  correct  key  and  not  vocalize  the 
number.  During  Condition  2  (vocal  only),  the  subjects  vocalized  the  number  and  did 
not  press  the  key.  During  Condition  3  (key  presses  and  vocalizations),  the  subjects 
vocalized  the  number  and  simultaneously  pressed  the  correct  key.  All  conditions 
were  randomized  to  control  order  effects.  At  the  end  of  three  tests  of  each 
condition,  the  subjects  estimated  self-performance  as  percent  correct  and  rated 
stress  on  a  scale  of  one  (no  stress)  to  seven  (high  stress)  for  each  presentation  rate. 
The  subjects  were  allowed  as  many  practice  sessions  as  needed  to  reach  a  criterion 
of  90%  correct  response  at  the  1.5-sec  rate  for  Condition  3  (key  pressed  and 
vocalization).  All  subjects  reached  the  criterion  within  three  1-min  sessions.  After 
practice,  a  1-min  pretest  was  given  using  a  self-paced  mode  feature  that  allowed 
the  subject  to  control  the  presentation  rate  of  the  stimuli  by  responding  as  quickly 
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and  accurately  as  possible.  The  pretest  was  used  as  a  measure  of  each  individual’s 
baseline  responding  rate.  Also,  a  self-pacing  posttest  was  given  at  the  end  of  the 
nine  1-min  tests  to  determine  any  differences  in  mean  deviations  from  pretest 
baseline  conditions. 

17.  The  voice  was  analyzed  by  a  PSE  manufactured  by  Dektor,  Incorporated, 
developed  specifically  as  a  deception-detection  instrument.  The  device  consists  of 
a  signal  analyzer,  a  strip  chart  pen  recorder,  a  magnetic  tape  recorder,  and  a 
microphone.  Voice  signals  were  initially  recorded  on  magnetic  tape,  then  processed 
through  the  analyzer  circuits,  and  displayed  on  a  strip  chart  for  subsequent  visual 
analysis  and  interpretation. 

SCORING  STRATEGY 

18.  Interpretation  of  the  strip  chart  tracing  is  typically  accomplished  by  visual 
examination  of  the  recorded  signal  for  specific  patterns.  Comprehensive  training  is 
necessary  to  identify  the  most  important  characteristics  of  each  pattern  in  order 
to  relate  the  output  signals  to  a  level  of  stress.  All  PSE  voice  charts  were  coded  by 
group  number,  presentation  rate,  and  response  condition,  i.e.,  with  or  without  key 
presses.  A  scoring  criterion  of  waveform  uniformity  was  developed  from  the  coded 
charts  by  two  interpreters  trained  to  recognize  stress  patterns  and  response  word 
artifacts  in  the  PSE  charts.  The  scoring  criterion  of  uniformity  was  quantified  by 
simulating  vocal  stress  patterns  as  shown  in  figure  la.  These  patterns  were 
generated  by  an  analogue  waveform  synthesizer  and  fed  into  a  Varian  73  computer 
via  an  analogue-to-digital  converter.  The  peak-to-peak  displacement  scoring 
algorithm  was  detected  by  sampling  at  2,000  times-per-second.  The  sample 
waveforms  were  scored  and  ranked  for  degree  of  stress/uniformity  that  yielded  a 
scale  from  0  (low  stress/uniformity)  to  10  (high  stress/uniformity).  The  uniformity 
scale  was  then  used  to  quantify  the  actual  measured  vocal  stress  patterns  for  each 
selected  test  subject. 

19.  Representative  samples  of  actual  vocal  patterns  indicating  a  range  of  stress 
are  shown  in  figure  lb.  Notice  the  degree  of  change  in  the  uniformity,  i.e.,  peak- 
to-peak  differences  of  the  sample  waveforms.  The  lack  of  uniformity  or  more 
randomization  in  the  pattern  indicates  less  stress  manifested  in  the  vocal  output  of 
the  subject.  These  uniformity  changes  are  quantified  in  units  of  stress  as  shown  in 
the  sample  patterns. 

20.  The  automated  scoring  technique,  as  shown  in  more  detail  in  appendix  A,  was 
used  to  measure  the  degree  of  stress  in  the  waveform  pattern  identified  by  the  PSE 
from  the  vocal  utterances  of  each  selected  test  subject.  The  measurement 
technique  provided  a  reliable  and  expedient  method  of  scoring  large  quantities  of 
data.  For  example,  each  of  the  selected  12  subjects,  for  nine  1-min  tests  and  the 
pre-posttests  yielded  approximately  850  vocal  utterances  to  be  analyzed.  Since  the 
stimulus  numbers  were  presented  randomly  within  each  presentation  rate,  a 
sampling  method  was  devised  to  eliminate  approximately  one-half  of  the  data  by 
selecting  every  other  utterance  at  the  .75  sec/symbol  rate  and  every  third 
utterance  at  the  .50  sec/symbol  rate.  All  vocalizations  at  the  slow  rate  of 
1.5  sec/symbol  and  the  pre-post  conditions  were  analyzed  without  sampling. 
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Figure  la 

Simulated  Vocal  Stress  Patterns 
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Figure  lb 

Measured  Vocal  Stress  Units 
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RESULTS 

PERFORMANCE  SCORES  (%  CORRECT  RESPONSES) 

21.  The  data  for  percent-correct  responses  for  all  subjects  under  each  experi¬ 
mental  condition  are  presented  in  figure  2.  Significant  main  effects  were  obtained 
from  ANOVA  for  groups  (F  =  6.02,  df  =  2/234,  p  <  .05)  and  presentation  rate 
F  =  1308.27,  df  =  4/234,  pc. 05).  No  significant  differences  were  found  in  the 
correct  responses  of  the  subjects  when  the  number  was  vocalized  or  not  vocalized 
(F  =  0.41,  df  =  1/234,  p  >.05). 
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JET  PROP  STAFF 

GROUP  GROUP  GROUP 

nn  _ _ 
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Figure  2 

Percent  Correct  Key  Presses 
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22.  Tests  of  simple  main  effects  produced  significant  differences  across  groups  at 
each  presentation  rate.  A  series  of  Tukey's  mean  comparison  tests  revealed  that 
presentation  rate  means  were  all  significantly  different  from  each  other  within 
each  group. 

23.  No  differences  were  obtained  between  groups  except  at  the  .75-sec  presenta¬ 
tion  rate.  The  Staff  group  (X  =  62.93%)  produced  significantly  lower  correct 
responses  than  the  Jet  (X  =  72.31%)  or  Prop  (X  =  77.40%)  groups.  There  were  no 
significant  differences  in  percent  correct  scores  between  the  Jet  and  Prop  groups. 

24.  The  percent  correct  estimates  by  the  subjects  did  not  show  any  significant 
differences  across  groups  or  response  conditions.  The  estimates  of  performance 
were  significantly  affected  by  presentation  rate.  Performance  was  consistently 
estimated  to  be  lower  than  actual  percent-correct  scores  at  the  1.5-sec  and 
.75-sec  rates.  The  subjects  significantly  gave  higher  estimates  of  performance  at 
the  .5-sec  rate  than  actual  measured  percent-correct  scores. 

SELF-RATED  STRESS 

25.  The  data  for  self-rated  stress  for  all  subjects  under  each  experimental 
condition  are  presented  in  figure  3.  Significant  main  effect  differences  were 
obtained  for  groups,  presentation  rates,  and  response  condition.  No  significant 
interactions  were  found  between  any  of  the  main  effect  variables  on  the  self-rated 
stress  measure. 


1.5  0.75  0.5  1.5  0.75  0.5  1.5  0.75  0.5 

PRESENTATION  RATE  [SECS] 


Figure  3 
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Figure  5 

Measured  Vocal  Stress  Deviations  from  Baseline 

30.  Presentation  rate  significantly  affected  deviations  from  baseline  stress  as 
measured  in  the  voice  across  all  response  conditions  (F  =  59*65,  df  =  2/54,  p  <*01) 
except  at  the  1.5-sec  presentation  rate.  Vocal  stress  significantly  increased  with 
simultaneous  key  pressing  across  rank-ordered  groups  and  presentation  rates 
F  =  8.96,  df  =  2/54,  p  c.Ol)  except  with  the  middle-ranked  group. 

31.  The  pretest  and  post  test  baseline  conditions  did  not  differ  in  vocal  stress. 
Also,  the  baseline  vocal  stress  measures  taken  during  self-paced  presentation  rates 
showed  no  consistent  trends  with  the  automatic-paced  presentation  rates.  That  is, 
a  subject  that  had  a  high  or  low  baseline  vocal  stress  level  did  not  have  a 
corresponding  high  or  low  vocal  stress  level  during  testing  conditions.  Therefore, 
the  pretest  baseline  vocal  stress  measure  could  not  be  used  as  a  predictor  of  the 
succeeding  vocal  stress  levels  during  presentation  rate  increases. 

CORRELATIONS 

32.  Spearman  rank-order  (Rho)  correlations  were  calculated  to  determine  if  any 
rank-order  relationships  existed  between  subjects  for  percent-correct  scores,  self- 
rated  stress,  and  vocal  stress  measures.  Table  I  presents  the  resultant  vocal  stress 
rank-order  correlations.  Significant  inverse  relationships  are  shown  to  exist 
between  vocal  stress  and  percent-correct  scores  at  the  .75-sec  and  .5-sec  rates, 
i.e.,  as  percent-correct  scores  decreased,  vocal  stress  increased.  There  were  no 
significant  correlations  under  "key  pressing  only”  response  conditions. 
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Table  I 

Rank-Order  Correlations 
(Vocal  Stress  X  %  Correct  Scores) 

VOCAL 

STRESS  %  CORRECT  SCORES 


VOICE 

+  KEY  PRESSES 

KEY  PRESSES  ONLY 

1.5  SEC 

0.75  SEC 

0.5  SEC 

1.5  SEC 

0.75  SEC  0.5  SEC 

VOICE 

ONLY 

-0.13 

-0.58* 

-0.51* 

0.03 

-0.46  -0.27 

VOICE  + 
KEY  PRESSES 

0.06 

-0.73** 

-0.68* 

-0.17 

-0.48  -0.34 

*RHO  =  0.506,  IM  —  12,  p^<  0.05  (ONE-TAILED  TEST] 

**RH0  =  0.712,  N  =  12,  p<  0.01  (ONE-TAILED  TEST] 

33.  Significant  rank-order  correlations  were  found  between  vocal  stress  and  self- 
rated  stress  as  shown  in  table  II  at  the  .75-sec  presentation  rate  but  only  when 
response  conditions  were  identical,  i.e.,  voice  only  or  voice  plus  key  presses.  For 
example,  vocal  stress  measurements  taken  during  voice  only  response  conditions 
showed  no  significant  rank-order  correlations  with  self-rated  stress  during  voice 
plus  key  pressing  response  conditions. 


VOCAL 

STRESS 


Table  H 

Rank-Order  Correlations 
(Vocal  Stress  X  Self-Rated  Stress) 


SELF-RATED  STRESS 


VOICE  ONLY  VOICE  +  KEY  PRESSES 

1.5  SEC  0.75  SEC  0.5  SEC  1.5  SEC  0.75  SEC  0.5  SEC 


VOICE 

ONLY 


0.14  0.56*  0.23  0.02  0.42 


0.02 


VOICE  + 
KEY  PRESSES 


0.04  0.44  0.18  0.16  0.60*  0.15 


*RHO  »  0.506,  N  =  12,  p  *<  0.05  (ONE-TAILED  TEST) 
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DISCUSSION 


34.  The  psychomotor  performance  results  of  this  study  are  consistent  with 
previous  findings  by  Kodalen  (1975)  and  Helm,  Fishburne,  and  Waag  (1974)  in  using 
the  RATER  to  deteriorate  performance  by  increasing  the  presentation  rate.  That 
is,  percent-correct  scores  decreased  and  self-rated  stress  increased  as  the  require¬ 
ment  to  process  bits  of  information  increased.  However,  the  uniqueness  of  this 
study  has  shown  that  an  objective,  reliable,  and  sensitive  measure  can  be  utilized 
to  study  the  underlying  relationship  between  stress  as  manifested  in  the  voice  and 
performance  on  a  psychomotor  task. 

35.  The  objectivity  of  this  vocal  measure  has  been  established  by  identifying  and 
quantifying  the  degree  of  correspondence  that  performance  has  with  a  subjective 
pattern  recognition  criterion  of  uniformity.  In  addition,  this  numerical  rating 
technique  was  used  to  reliably  indicate  the  degree  of  sensitivity  present  in  the 
measure  by  scaling  a  range  of  uniformity  over  a  wide  spectrum  of  simulated-stress, 
waveform  patterns. 

36.  These  data  indicate  that  the  most  predominate  variable  to  affect  all  subjects 
under  each  response  mode  for  each  dependent  measure  was  presentation  rate. 
However,  a  closer  examination  of  the  data  reveals  that  almost  all  of  the  dependent 
measures  that  reflected  significant  differences  between  groups  did  so  only  at  the 
.75-sec  presentation  rate.  This  selective  effect  was  due  primarily  to  understimula¬ 
tion  at  the  1.5-sec  rate  and  overstimulation  at  the  .5-sec  rate  that  produced  an 
overload  information  processing  situation.  The  percent-correct  scores  and  self- 
rated  stress  estimates  support  the  conclusion  that  the  slowest  and  fastest  rates 
narrowed  the  sensitivity  range  of  these  dependent  measures  to  the  extent  that  no 
discrimination  could  occur  between  the  subjects,  e.g.,  self-rated  stress  estimates 
were  approximately  the  same.  However,  the  vocal  stress  measure  significantly 
differentiated  between  subjects  at  both  the  .75-sec  and  .50-sec  rates  which 
indicates  a  more  sensitive  measurement  technique. 

37.  The  sensitivity  of  the  vocal  measure  to  discriminate  between  the  performance 
of  subjects  even  when  the  percent-correct  score  range  of  20.5%  was  narrow  (38% 
maximum  and  17.5%  minimum)  was  further  demonstrated  by  a  highly  significant 
correlation  (Rho  = -.69)  when  compared  to  the  nonsignificant  correlation  of  the 
self-rated  stress  estimates  at  the  .50-sec  rate.  This  lack  of  correlation  is  an 
inherent  limitation  imposed  by  any  rating  scale  that  is  anchored  at  the  extremes  by 
either  a  maximum  or  minimum  value.  That  is,  the  subjects  were  not  allowed  to 
manifest  the  absolute  degree  of  stress.  Likewise,  it  should  be  noted  that  vocal 
stress  correlations  with  percent-correct  scores  (Rho  =  -.73)  and  self-rated  stress 
estimates  (Rho  =  .60)  improved  at  the  .75-sec  rate.  The  corresponding  percent- 
correct  scores  also  increased  from  a  range  of  21%  to  a  range  of  52%  and  the  self- 
rated  estimates  increased  from  a  range  of  0  to  4  stress  units,  indicating  an 
improvement  to  discriminate  or  estimate  individual  performance.  The  degree  of 
sensitivity  of  the  vocal  stress  measure  is  an  important  finding  of  this  study. 

38.  The  correlational  data  also  clearly  indicate  that  performance  can  be  better 
predicted  from  vocal  stress  measures  or  self-rated  stress  when  the  measures  or 
ratings  are  taken  during  the  same  time  of  testing  and  involve  the  same  reponse 
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mode,  e.g.,  voice  mode  only.  Vocalizing  the  numbers  only  resulted  in  significantly 
lower  self-rated  stress  estimates  and  vocal  stress  measurements  than  when 
vocalizing  and  key  pressing  simultaneously.  The  data  clearly  reveal  that  key 
pressing  was  the  main  factor  that  increases  stress.  Whether  the  subjects  would 
show  a  corresponding  improvement  or  deterioration  in  vocal  performance  could  not 
be  evaluated  because  a  reliable  method  of  scoring  correct  verbal  responses  could 
not  be  devised  due  to  the  high  rate  of  vocalization,  e.g.,  .5  sec. 

39.  Another  weakness  in  the  study  that  limited  a  more  extensive,  statistical 
evaluation  was  the  large  number  of  data  points  per  subject  that  necessitated 
scoring  a  smaller  (N=12)  sample  than  desired.  The  results  and  conclusions  of  this 
study  should  be  interpreted  with  the  reminder  that  the  subjects  were  selected  to 
represent  the  extremes  and  middle  of  the  performance  continuum  for  the  purpose 
of  proper  scaling  the  sensitivity  of  a  new  measure.  It  is  presently  unknown  whether 
the  significant  high  correlations  found  in  this  study  would  remain  significant  with 
the  inclusion  of  the  remaining  30  subjects. 

40.  In  summary,  the  following  conclusions  and  recommendations  are  warranted 
based  on  the  results  of  this  study. 
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CONCLUSIONS 

41.  An  objective,  reliable,  and  sensitive  pat  tern -recognition -scoring  technique  has 
been  identified. 

42.  The  quantification  of  this  scoring  technique  has  been  automated. 

43.  The  relationship  of  vocal  stress  measures  to  percent-correct  scores  and  self- 
rated  stress  has  been  significantly  correlated. 

44.  A  potentially  useful  and  unobtrusive  test  and  evalaution  technique  has  been 
established. 
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RECOMMENDATIONS 

45.  Plan  collaborative  efforts  with  other  laboratories  attempting  to  validate  the 
Psychological  Stress  Evaluator. 

46.  Obtain  field  test  data  from  aircrew  tasks  that  require  vocalizations  over  a 
range  of  information  processing  loads. 

47.  Scale  the  selected  aircrew  tasks  by  degree  of  vocalized  stress. 

48.  Utilize  the  vocal  stress  measure  in  conjunction  with  secondary  task 
methodology. 
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GLOSSARY 

Glossary  of  terms  for  the  automatic  stress  scoring  program  flow  diagram: 

TRIGR  -  Trigger;  the  minimum  value  to  be  considered  a  signal.  All  values 
below  this  chosen  level  are  considered  to  be  noise  and  are  disregarded 
by  the  computer. 

ARRAY1  -  Storage  array  for  raw  data. 

ARRAY2  -  Storage  array  for  maximum  and  minimum  values  of  the  raw  data 
from  ARRAY1. 

ARRAY3  -  Storage  array  for  the  absolute  value  of  the  differences  between 
consecutive  values  in  ARRAY2,  i.e.,  "peak-to-peak"  values. 

ST1  -  Stress  1;  a  single  number  formed  by  taking  the  product  of  the 

differences  between  the  values  in  the  odd-numbered  locations  of 
ARRAY3.  This  value  is  related  to  the  stress  content  contributed  by 
either  the  positive  or  negative  slopes  of  the  waveform  and  is 
complimentary  to  ST2.  Absolute  determination  of  positive  or 
negative  slope  is  not  relevant,  just  magnitude  being  significant. 

ST2  -  Stress  2;  the  compliment  of  ST1. 

T1  -  Tendency  to  Stress  1;  a  more  workable  form  of  ST1.  T1  =  - log ^  STL 

Larger  values  of  T1  indicate  higher  stress. 

T2  -  Tendency  to  Stress  2;  a  more  workable  form  of  ST2.  T2  =  -log^  ST2. 

Larger  values  of  T2  indicate  higher  stress. 

AVES  -  Average  Stress;  the  average  of  T1  and  T2.  One  value  of  AVES  should 
be  calculated  per  each  analyzed  utterance. 
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AUTOMATIC  STRESS  SCORING  PROGRAM  FLOW  DIAGRAM 
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Wright  State  University,  Dayton 

(Dr.  F.  T. 'Eggemeier)  (1) 

Hughes  Helicopter,  Culver  City 

(Mr.  S.  A.  Thompson)  (1) 

Rockwell  International/Collins  Division, 

Cedar  Rapids  (Dr.  L.  Silverstein)  (1) 

Dr.  Jack  Laveson,  Annandale,  Va.  (1) 

Northern  Illinois  University,  Dekab 

(Dr.  T.  Wiegele)  (1) 

University  of  South  Dakota,  Vermillion 

(LCDR  W.  Helm)  (1) 

Dektor,  Incorporated,  Springfield 

(Ms.  P.  Gerner)  (3) 
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Systems  Research  Laboratories, 

Incorporated,  Dayton 

(Mr.  R.  Spicuzza)  (1) 

Bell  Helicopter  Company,  Fort  Worth 

(Dr.  D.  Strothers)  (1) 

University  of  Illinois,  Champaign 

(Dr.  C.  Wickens)  (1) 

Honeywell,  Incorporated,  Minneapolis 

(Dr.  R.  North)  (1) 

Systems  Research  Laboratories, 

Incorporated,  Dayton 

(Mr.  M.  Crabtree)  (1) 
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